DAEMON: Dataset/Platform-Agnostic Explainable Malware Classification Using Multi-Stage Feature Mining
نویسندگان
چکیده
Numerous metamorphic and polymorphic malicious variants are generated automatically on a daily basis by mutation engines that transform the code of program while retaining its functionality, in order to evade signature-based detection. These automatic processes have greatly increased number malware variants, deeming their fully-manual analysis impossible. Malware classification is task determining which family new variant belongs. Variants same show similar behavioral patterns. Thus, classifying newly discovered programs applications helps assess risks they pose. Moreover, facilitates should undergo manual security expert, determine whether belong (e.g., one whose members exploit zero-day vulnerability) or simply result concept drift within known family. This motivated intense research recent years devising high-accuracy tools for classification. In this work, we present DAEMON - novel dataset-agnostic classifier. A key property type features it uses manner mined facilitate understanding distinctive behavior families, making decisions explainable. We've optimized using large-scale dataset x86 binaries, belonging mix several families targeting computers running Windows. We then re-trained applied it, without any algorithmic change, feature re-engineering parameter tuning, two other datasets Android consisting numerous families. obtained highly accurate results all datasets, establishing also platform-agnostic.
منابع مشابه
Feature Selection for Malware Classification
In applying machine learning to malware identification, different types of features have proven to be successful. These features have also been tested with different kinds of classification methodologies and have had varying degrees of success. Every time a new machine learning methodology is introduced for classifying malware, there is the potential for increasing the overall quality of malwar...
متن کاملCAMP: Content-Agnostic Malware Protection
--In spite of recent advances, the world wide web remains an important vector for malware installation. Approaches to evaluating potentially malicious code before execution in a browser, such as blacklisting or content-based detection are hindered by an attacker’s ability to easily change hosting domains or mutate malware binaries. On the other hand, whitelistbased approaches are challenged by ...
متن کاملMalware Detection using Classification of Variable-Length Sequences
In this paper, a novel method based on the graph is proposed to classify the sequence of variable length as feature extraction. The proposed method overcomes the problems of the traditional graph with variable length of data, without fixing length of sequences, by determining the most frequent instructions and insertion the rest of instructions on the set of “other”, save speed and memory. Acco...
متن کاملUsing Multi-Feature and Classifier Ensembles to Improve Malware Detection
With the rapid growth of internet application, malware has become one of the major threats to information security. Traditionally, anti-virus products use signature matching to detect malware, but the drawback is that they can not detect new and unknown malware. Recent studies showed that the use of machine learning can successfully detect new and unknown malware, but the limitation of this tec...
متن کاملFeature Reduction to Speed Up Malware Classification
In statistical classification work, one method of speeding up the process is to use only a small percentage of the total parameter set available. In this paper, we apply this technique both to the classification of malware and the identification of malware from a set combined with cleanware. In order to demonstrate the usefulness of our method, we use the same sets of malware and cleanware as i...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Access
سال: 2021
ISSN: ['2169-3536']
DOI: https://doi.org/10.1109/access.2021.3082173